NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses

Stapleton, Logan; Taylor, Jordan; Fox, Sarah; Wu, Tongshuang; Zhu, Haiyi. (July 2023, ICML Workshop)

Large generative AI models (GMs) like GPT and DALL-E are trained to generate content for general, wide-ranging purposes. GM content filters are generalized to filter out content which has a risk of harm in many cases, e.g., hate speech. However, prohibited content is not always harmful -- there are instances where generating prohibited content can be beneficial. So, when GMs filter out content, they preclude beneficial use cases along with harmful ones. Which use cases are precluded reflects the values embedded in GM content filtering. Recent work on red teaming proposes methods to bypass GM content filters to generate harmful content. We coin the term green teaming to describe methods of bypassing GM content filters to design for beneficial use cases. We showcase green teaming by: 1) Using ChatGPT as a virtual patient to simulate a person experiencing suicidal ideation, for suicide support training; 2) Using Codex to intentionally generate buggy solutions to train students on debugging; and 3) Examining an Instagram page using Midjourney to generate images of anti-LGBTQ+ politicians in drag. Finally, we discuss how our use cases demonstrate green teaming as both a practical design method and a mode of critique, which problematizes and subverts current understandings of harms and values in generative AI.
more » « less
Strategic Instrumental Variable Regression: Recovering Causal Relationships From Strategic Responses

Harris, Keegan; Ngo, Dung Daniel; Stapleton, Logan; Heidari, Hoda; Wu, Steven (July 2022, Proceedings of the 39th International Conference on Machine Learning)

Full Text Available
Imagining new futures beyond predictive systems in child welfare: A qualitative study with impacted stakeholders

https://doi.org/10.1145/3531146.3533177

Stapleton, Logan; Lee, Min Hun; Qing, Diana; Wright, Marya; Chouldechova, Alexandra; Holstein, Ken; Wu, Zhiwei Steven; Zhu, Haiyi (June 2022, 2022 ACM Conference on Fairness, Accountability, and Transparency)

Full Text Available
“Why Do I Care What’s Similar?” Probing Challenges in AI-Assisted Child Welfare Decision-Making through Worker-AI Interface Design Concepts

https://doi.org/10.1145/3532106.3533556

Kawakami, Anna; Sivaraman, Venkatesh; Stapleton, Logan; Cheng, Hao-Fei; Perer, Adam; Wu, Zhiwei Steven; Zhu, Haiyi; Holstein, Kenneth (June 2022, Designing Interactive Systems Conference)

Full Text Available
Who Has an Interest in “Public Interest Technology”?: Critical Questions for Working with Local Governments & Impacted Communities

https://doi.org/10.1145/3500868.3560484

Stapleton, Logan; Saxena, Devansh; Kawakami, Anna; Nguyen, Tonya; Ammitzbøll Flügge, Asbjørn; Eslami, Motahhare; Holten Møller, Naja; Lee, Min Kyung; Guha, Shion; Holstein, Kenneth; et al (November 2022, Companion Publication of the 2022 Conference on Computer Supported Cooperative Work and Social Computing (CSCW 2022))

Full Text Available
Improving Human-AI Partnerships in Child Welfare: Understanding Worker Practices, Challenges, and Desires for Algorithmic Decision Support

https://doi.org/10.1145/3491102.3517439

Kawakami, Anna; Sivaraman, Venkatesh; Cheng, Hao-Fei; Stapleton, Logan; Cheng, Yanghuidi; Qing, Diana; Perer, Adam; Wu, Zhiwei Steven; Zhu, Haiyi; Holstein, Kenneth (April 2022, Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems)

Full Text Available
How Child Welfare Workers Reduce Racial Disparities in Algorithmic Decisions

https://doi.org/10.1145/3491102.3501831

Cheng, Hao-Fei; Stapleton, Logan; Kawakami, Anna; Sivaraman, Venkatesh; Cheng, Yanghuidi; Qing, Diana; Perer, Adam; Holstein, Kenneth; Wu, Zhiwei Steven; Zhu, Haiyi (April 2022, Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems)

Full Text Available
Soliciting Stakeholders’ Fairness Notions in Child Maltreatment Predictive Systems

https://doi.org/10.1145/3411764.3445308

Cheng, Hao-Fei; Stapleton, Logan; Wang, Ruiqi; Bullock, Paige; Chouldechova, Alexandra; Wu, Zhiwei Steven; Zhu, Haiyi (May 2021, CHI '21: Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems)
null (Ed.)
Recent work in fair machine learning has proposed dozens of technical definitions of algorithmic fairness and methods for enforcing these definitions. However, we still lack an understanding of how to develop machine learning systems with fairness criteria that reflect relevant stakeholders’ nuanced viewpoints in real-world contexts. To address this gap, we propose a framework for eliciting stakeholders’ subjective fairness notions. Combining a user interface that allows stakeholders to examine the data and the algorithm’s predictions with an interview protocol to probe stakeholders’ thoughts while they are interacting with the interface, we can identify stakeholders’ fairness beliefs and principles. We conduct a user study to evaluate our framework in the setting of a child maltreatment predictive system. Our evaluations show that the framework allows stakeholders to comprehensively convey their fairness viewpoints. We also discuss how our results can inform the design of predictive systems.
more » « less
Full Text Available
An Algorithmic Framework for Fairness Elicitation

https://doi.org/10.4230/LIPIcs.FORC.2021.2

Jung, Christopher; Kearns, Michael; Neel, Seth; Roth, Aaron; Stapleton, Logan; Wu, Zhiwei Steven (January 2021, 2nd Symposium on Foundations of Responsible Computing (FORC 2021))

We consider settings in which the right notion of fairness is not captured by simple mathematical definitions (such as equality of error rates across groups), but might be more complex and nuanced and thus require elicitation from individual or collective stakeholders. We introduce a framework in which pairs of individuals can be identified as requiring (approximately) equal treatment under a learned model, or requiring ordered treatment such as "applicant Alice should be at least as likely to receive a loan as applicant Bob". We provide a provably convergent and oracle efficient algorithm for learning the most accurate model subject to the elicited fairness constraints, and prove generalization bounds for both accuracy and fairness. This algorithm can also combine the elicited constraints with traditional statistical fairness notions, thus "correcting" or modifying the latter by the former. We report preliminary findings of a behavioral study of our framework using human-subject fairness constraints elicited on the COMPAS criminal recidivism dataset.
more » « less
Full Text Available

Search for: All records